Rank in Wordlist | Frequency | Word |
---|---|---|
1156 | 3 | 1,5 |
1163 | 3 | 2,2 |
1633 | 2 | 1,6 |
2871 | 1 | 0,25 |
2872 | 1 | 0,6 |
2885 | 1 | 1,4 |
2886 | 1 | 1,8 |
2892 | 1 | 10,3 |
2893 | 1 | 10,5 |
2916 | 1 | 16,4 |
Rank in Wordlist | Frequency | Word |
---|---|---|
3700 | 1 | F*&n |
4010 | 1 | H&M |
Rank in Wordlist | Frequency | Word |
---|---|---|
1770 | 2 | God's |
3504 | 1 | DYRSKU'N |
3755 | 1 | Femmer'n |
4902 | 1 | PC'en |
5225 | 1 | Sheriff's |
6276 | 1 | by'n |
7315 | 1 | i'en |
Rank in Wordlist | Frequency | Word |
---|---|---|
3700 | 1 | F*&n |
Rank in Wordlist | Frequency | Word |
---|---|---|
1992 | 2 | Tips/bilder |
2377 | 2 | km/t |
3421 | 1 | Byggeindustrien/Bygg |
4021 | 1 | HOLMESTRAND/OSLO |
4597 | 1 | Manglerud/Star |
4672 | 1 | Moldstad/Stormark |
4751 | 1 | Nest/Sotra |
5132 | 1 | SMS/MMS |
5361 | 1 | Statoil/Hydro-fusjon |
5454 | 1 | TV/videogutten |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots